Kings County
FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline
Singh, Kuldeep, Kaur, Simerjot, Smiley, Charese
Financial decision-making hinges on the analysis of relevant information embedded in the enormous volume of documents in the financial domain. To address this challenge, we developed FinQAPT, an end-to-end pipeline that streamlines the identification of relevant financial reports based on a query, extracts pertinent context, and leverages Large Language Models (LLMs) to perform downstream tasks. To evaluate the pipeline, we experimented with various techniques to optimize the performance of each module using the FinQA dataset. We introduced a novel clustering-based negative sampling technique to enhance context extraction and a novel prompting method called Dynamic N-shot Prompting to boost the numerical question-answering capabilities of LLMs. At the module level, we achieved state-of-the-art accuracy on FinQA, attaining an accuracy of 80.6%. However, at the pipeline level, we observed decreased performance due to challenges in extracting relevant context from financial reports. We conducted a detailed error analysis of each module and the end-to-end pipeline, pinpointing specific challenges that must be addressed to develop a robust solution for handling complex financial tasks.
A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model
Xiang, Ao, Huang, Bingjie, Guo, Xinyu, Yang, Haowei, Zheng, Tianyao
The challenge of finding content that aligns with users' interests within this abundance has become increasingly important. Recommender systems play a crucial role in addressing this issue, as they have the potential to provide precise recommendations that enhance user experience and save time in commercial applications [1]. These systems predict user ratings for specific items by employing data mining techniques and related predictive algorithms to make highly relevant predictions. By analyzing user historical behavior, preferences, and item characteristics, recommender systems effectively solve the information filtering problem by automatically matching items that may be of interest to users. Traditional recommender systems primarily consist of collaborative filtering [2], content-based recommendations [3], and hybrid recommendation methods, among which collaborative filtering is one of the earliest and most widely used techniques for recommending products or items based on past purchasing history.
The Machine Ethics podcast: AI fictions with Alex Shvartsman
Hosted by Ben Byford, The Machine Ethics Podcast brings together interviews with academics, authors, business leaders, designers and engineers on the subject of autonomous algorithms, artificial intelligence, machine learning, and technology's impact on society. This episode we're chatting with Alex Shvartsman about our AI future, human crafted storytelling, the generative AI use backlash, disclaimers for generated text, human vs AI authorship, practical or functional goals of LLMs, changing themes in science fiction, a diversity of international perspectives and moreโฆ Alex Shvartsman resides in Brooklyn, New York, and is the author of Kakistocracy (2023), The Middling Affliction (2022), and Eridani's Crown (2019) fantasy novels. Over 120 of his stories have appeared in Analog, Nature, Strange Horizons, etc. He won the WSFA Small Press Award for Short Fiction and was a three-time finalist for the Canopus Award for Excellence in Interstellar Fiction. His translations from Russian have appeared in F&SF, Clarkesworld, Tor.com, Analog, Asimov's, etc. Alex has edited over a dozen anthologies, including the long-running Unidentified Funny Objects series.
T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients
Ortigossa, Evandro S., Dias, Fรกbio F., Barr, Brian, Silva, Claudio T., Nonato, Luis Gustavo
The development of machine learning applications has increased significantly in recent years, motivated by the remarkable ability of learning-powered systems to discover and generalize intricate patterns hidden in massive datasets. Modern learning models, while powerful, often exhibit a level of complexity that renders them opaque black boxes, resulting in a notable lack of transparency that hinders our ability to decipher their decision-making processes. Opacity challenges the interpretability and practical application of machine learning, especially in critical domains where understanding the underlying reasons is essential for informed decision-making. Explainable Artificial Intelligence (XAI) rises to meet that challenge, unraveling the complexity of black boxes by providing elucidating explanations. Among the various XAI approaches, feature attribution/importance XAI stands out for its capacity to delineate the significance of input features in the prediction process. However, most existing attribution methods have limitations, such as instability, when divergent explanations may result from similar or even the same instance. In this work, we introduce T-Explainer, a novel local additive attribution explainer based on Taylor expansion endowed with desirable properties, such as local accuracy and consistency, while stable over multiple runs. We demonstrate T-Explainer's effectiveness through benchmark experiments with well-known attribution methods. In addition, T-Explainer is developed as a comprehensive XAI framework comprising quantitative metrics to assess and visualize attribution explanations.
Statistical Mechanics and Artificial Neural Networks: Principles, Models, and Applications
Bรถttcher, Lucas, Wheeler, Gregory
The field of neuroscience and the development of artificial neural networks (ANNs) have mutually influenced each other, drawing from and contributing to many concepts initially developed in statistical mechanics. Notably, Hopfield networks and Boltzmann machines are versions of the Ising model, a model extensively studied in statistical mechanics for over a century. In the first part of this chapter, we provide an overview of the principles, models, and applications of ANNs, highlighting their connections to statistical mechanics and statistical learning theory. Artificial neural networks can be seen as high-dimensional mathematical functions, and understanding the geometric properties of their loss landscapes (i.e., the high-dimensional space on which one wishes to find extrema or saddles) can provide valuable insights into their optimization behavior, generalization abilities, and overall performance. Visualizing these functions can help us design better optimization methods and improve their generalization abilities. Thus, the second part of this chapter focuses on quantifying geometric properties and visualizing loss functions associated with deep ANNs.
Stackelberg Meta-Learning Based Shared Control for Assistive Driving
Shared control allows the human driver to collaborate with an assistive driving system while retaining the ability to make decisions and take control if necessary. However, human-vehicle teaming and planning are challenging due to environmental uncertainties, the human's bounded rationality, and the variability in human behaviors. An effective collaboration plan needs to learn and adapt to these uncertainties. To this end, we develop a Stackelberg meta-learning algorithm to create automated learning-based planning for shared control. The Stackelberg games are used to capture the leader-follower structure in the asymmetric interactions between the human driver and the assistive driving system. The meta-learning algorithm generates a common behavioral model, which is capable of fast adaptation using a small amount of driving data to assist optimal decision-making. We use a case study of an obstacle avoidance driving scenario to corroborate that the adapted human behavioral model can successfully assist the human driver in reaching the target destination. Besides, it saves driving time compared with a driver-only scheme and is also robust to drivers' bounded rationality and errors.
Technical Report on the Checkfor.ai AI-Generated Text Classifier
We present the CheckforAI text classifier, a transformer-based neural network trained to distinguish text written by large language models from text written by humans. CheckforAI outperforms zero-shot methods such as DetectGPT as well as leading commercial AI detection tools with over 9 times lower error rates on a comprehensive benchmark comprised of ten text domains (student writing, creative writing, scientific writing, books, encyclopedias, news, email, scientific papers, short-form Q&A) and 8 open- and closed-source large language models. We propose a training algorithm, hard negative mining with synthetic mirrors, that enables our classifier to achieve orders of magnitude lower false positive rates on high-data domains such as reviews. Finally, we show that CheckforAI is not biased against nonnative English speakers and generalizes to domains and models unseen during training.
Differentiable Optimization Based Time-Varying Control Barrier Functions for Dynamic Obstacle Avoidance
Dai, Bolun, Khorrambakht, Rooholla, Krishnamurthy, Prashanth, Khorrami, Farshad
Control barrier functions (CBFs) provide a simple yet effective way for safe control synthesis. Recently, work has been done using differentiable optimization (diffOpt) based methods to systematically construct CBFs for static obstacle avoidance tasks between geometric shapes. In this work, we extend the application of diffOpt CBFs to perform dynamic obstacle avoidance tasks. We show that by using the time-varying CBF (TVCBF) formulation, we can perform obstacle avoidance for dynamic geometric obstacles. Additionally, we show how to extend the TVCBF constraint to consider measurement noise and actuation limits. To demonstrate the efficacy of our proposed approach, we first compare its performance with a model predictive control based method and a circular CBF based method on a simulated dynamic obstacle avoidance task. Then, we demonstrate the performance of our proposed approach in experimental studies using a 7-degree-of-freedom Franka Research 3 robotic manipulator.
Experts believe 'panic buying' led to drug shortages for COVID, flu and RSV
A new report from the Milken Center for Public Health suggests "panic buying" of medications by patients and providers caused drug shortages. TRIPLE THREAT โ Amid COVID, flu and RSV, households and hospitals stockpiled meds, says new research. MORE MPOX โ CDC warns of new case clusters in Chicago. UNREALISTIC IDEALS โ Here's how AI defines the "perfect body." A new study by The Bulimia Project, a Brooklyn, New York-based website that publishes content and research related to eating disorders, investigated how AI perceived the "ideal" body based on social media data.
AI defines 'ideal body type' per social media โ here's what it looks like
Fox News correspondent Grady Trimble has the latest on fears the technology will spiral out of control on'Special Report.' Artificial intelligence has its own idea of what the perfect human body should look like. A new study by The Bulimia Project, a Brooklyn, New York-based website that publishes content and research related to eating disorders, investigated how AI perceived the "ideal" body based on social media data. The results, produced by AI-generated imaging tools such as Dall-E 2, Stable Diffusion and Midjourney, showed widely "unrealistic" body structures, as reported in a discussion of the findings on The Bulimia Project's website. Forty percent of the overall images depicted "unrealistic" body types of muscular men and women -- 37% for women and 43% for men -- according to the study.